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Abstract — Non-adaptive group testing involves grouping ar- 
bitrary subsets of n items into different pools. Each pool is 
then tested and defective items are identified. A fundamental 
question involves minimizing the number of pools required to 
identify at most d defective items. Motivated by applications in 
network tomography, sensor networks and infection propagation, 
a variation of group testing problems on graphs is formulated. 
Unlike conventional group testing problems, each group here 
must conform to the constraints imposed by a graph. For 
instance, items can be associated with vertices and each pool is 
any set of nodes that must be path connected. In this paper, a test 
is associated with a random walk. In this context, conventional 
group testing corresponds to the special case of a complete graph 
on n vertices. 

For interesting classes of graphs a rather surprising result is 
obtained, namely, that the number of tests required to identify 
d defective items is substantially similar to what is required 
in conventional group testing problems, where no such con- 
straints on pooling is imposed. Specifically, if T(n) corresponds 
to the mixing time of the graph G, it is shown that with 
m = 0(d 2 T (n) log(n/d)) non-adaptive tests, one can identify 
the defective items. Consequently, for the Erdos-Renyi random 
graph G(n,p), as well as expander graphs with constant spectral 
gap, it follows that m — 0(d 2 log 3 n) non-adaptive tests are 
sufficient to identify d defective items. Next, a specific scenario 
is considered that arises in network tomography, for which it is 
shown that m — 0(<f 3 log 3 n) non-adaptive tests are sufficient 
to identify d defective items. Noisy counterparts of the graph 
constrained group testing problem are considered, for which 
parallel results are developed. We also briefly discuss extensions 
to compressive sensing on graphs. 

Index Terms — Group testing, Sparse recovery, Network tomog- 
raphy, Sensor networks, Random walks. 

I. Introduction 
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IN this paper we introduce the graph constrained group test- 
ing problem motivated by applications in network tomogra- 
phy, sensor networks and infection propagation. While group 
testing theory (see (TJ, (2) and more recently Q), and its 
numerous applications, such as industrial quality assurance [4], 
DNA library screening [5], software testing |6), and multi- 
access communications [7 1, have been systematically explored, 
the graph constrained group testing problem is new to the best 
of our knowledge. 

Group testing involves identifying at most d defective items 
out of a set of n items. In non-adaptive group testing, which is 
the subject of this paper, we are given an m x n binary matrix, 
M, usually referred to as a test or measurement matrix. Ones 
on the jth row of M indicate which subset of the n items 
belongs to the jth pool. A test is conducted on each pool; a 
positive outcome indicating that at least one defective item is 
part of the pool; and a negative test indicating that no defective 
items are part of the pool. The conventional group testing 
problem is to design a matrix M with minimum number of 
rows m that guarantees error free identification of the defective 
items. While the best known (probabilistic) pooling design 
requires a test matrix with m = 0(d 2 \og(n/d)) rows, and an 
almost-matching lower bound of m = fl(d 2 (log n) / (logrf)) is 
known on the number of pools (cf. 12 Chapter 7]), the size 
of the optimal test still remains open. 

Note that in the standard group testing problem the test 
matrix M can be designed arbitrarily. In this paper we consider 
a generalization of the group testing problem to the case where 
the matrix M must conform to constraints imposed by a 
graph G = (V,E). In general, as we will describe shortly, 
such problems naturally arise in several applications such 
as network tomography (8), (5), sensor networks |10|, and 
infection propagation ifTTl . While the graph constrained group 
testing problem has been alluded to in these applications, the 
problem of test design or the characterization of the minimum 
number of tests, to the best of our knowledge, has not been 
addressed before. In this light our paper is the first to formalize 
the graph constrained group testing problem. In our graph 
group testing problem the n items are either vertices or links 
(edges) of the graph; at most d of them are defective. The 
task is to identify the defective vertices or edges. The test 
matrix M is constrained as follows: for items associated with 
vertices each row must correspond to a subset of vertices that 
are connected by a path on the graph; similarly, for items 
associated with links each row must correspond to links that 
form a path on G. The task is to design an m x n binary test 
matrix with minimum number of rows m that guarantees error 
free identification of the defective items. 

We will next describe several applications, which illustrate 
the graph constrained group testing problem. 
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Fig. 1. The route l->4->6^7^8-s>9is valid while 2 -+ 6 -» 5 is 
not. 

A. Network Tomography & Compressed Sensing over Graphs 

For a given network, identification of congested links from 
end-to-end path measurements is one of the key problems in 
network tomography J51, |8). In many settings of today's IP 
networks, there is one or a few links along the path which 
cause the packet losses in the path. Finding the locations of 
such congested links is sufficient for most of the practical 
applications. 

This problem can be understood as a graph-constrained 
group testing as follows. We model the network as a graph 
G = (V, E) where the set V denotes the network routers/hosts 
and the set E denotes the communication links (see Fig. [TJ. 
Suppose, we have a monitoring system that consists of one or 
more end hosts (so called vantage points) that can send and 
receive packets. Each vantage point sends packets through the 
network by assigning the routes and the end hosts. 

All measurement results (i.e., whether each packet has 
reached its destination) will be reported to a central server 
whose responsibility is to identify the congested links. Since 
the network is given, not any route is a valid one. A vantage 
point can only assign those routes which form a path in the 
graph G. The question of interest is to determine the number of 
measurements that is needed in order to identify the congested 
links in a given network. 

We primarily deal with Boolean operations on binary valued 
variables in this paper, namely, link states are binary valued 
and the measurements are boolean operations on the link 
states. Nevertheless, the techniques described here can be 
extended to include non-boolean operations and non-binary 
variables as well. Specifically, suppose there are a sparse 
set of links that take on non-zero values. These non-zero 
values could correspond to packet delays, and packet loss 
probabilities along each link. Measurements along each path 
provides aggregate delay or aggregate loss along the path. The 
set of paths generated by m random walks forms a m X \E\ 
routing matrix M. For an appropriate choice of m and graphs 
studied in this paper, it turns out (see [12]) that such routing 
matrices belongs to the class of so called expander matrices. 
These expander type properties in turn obey a suitable type 



of restricted-isometry -property (1-RIP) iTPTl . Such properties 
in turn are sufficient for recovering sparse vectors using 
£i optimization techniques. Consequently, the results of this 
paper have implications for compressed sensing on graphs. 

B. Sensor Networks 

The network tomography problem is further compounded 
in wireless sensor networks (WSN). As described in [ 10 1 the 
routing topology in WSN is constantly changing due to the 
inherent ad-hoc nature of the communication protocols. The 
sensor network is static with a given graph topology such as 
a geometric random graph. Sensor networks can be monitored 
passively or actively. In passive monitoring, at any instant, 
sensor nodes form a tree to route packets to the sink. The 
routing tree constantly changes unpredictably but must be 
consistent with the underlying network connectivity. A test 
is considered positive if the arrival time is significantly large, 
which indicates that there is at least one defective sensor node 
or a congested link. The goal is to identify defective links 
or sensor nodes based on packet arrival times at the sink. 
In active monitoring network nodes continuously calculate 
some high level, summarized information such as the average 
or maximum energy level among all nodes in the network. 
When the high level information indicates congested links, a 
low level and more energy consuming procedure is used to 
accurately locate the trouble spots. 

C. Infection Propagation 

Suppose that we have a large population where only a small 
number of people are infected by a certain viral sickness 
(e.g., a flu epidemic). The task is to identify the set of 
infected individuals by sending agents among them. Each 
agent contacts a pre-determined or randomly chosen set of 
people. Once an agent has made contact with an infected 
person, there is a chance that he gets infected, too. By the end 
of the testing procedure, all agents are gathered and tested for 
the disease. While this problem has been described in ifTTl . 
the analysis ignores the inherent graph constraints that need 
to be further imposed. It is realistic to assume that, once an 
agent has contacted a person, the next contact will be with 
someone in close proximity of that person. Therefore, in this 
model we are given a random geometric graph that indicates 
which set of contacts can be made by an agent (see Fig. |2j. 
Now, the question is to determine the number of agents that 
is needed in order to identify the set of infected people. 

These applications present different cases where graph con- 
strained group testing can arise. However, there are important 
distinctions. In the wired network tomography scenario the 
links are the items and each row of the matrix M is associated 
with a route between any two vantage points. A test is positive 
if a path is congested, namely, if it contains at least one 
congested link. Note that in this case since the routing table 
is assumed to be static, the route between any two vantage 
points is fixed. Consequently, the matrix M is deterministic 
and the problem reduces to determining whether or not the 
matrix M satisfies identifiability. 
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Fig. 2. Collective sampling using agents. The Cg) symbols represent infected 
people whereas the healthy population is indicated by • symbols. The dashed 
lines show the group of people contacted by each agent 1111 . 



Our problem is closer in spirit to the wireless sensor network 
scenario. In the passive case the links are the items and each 
row of the matrix M is associated with a route between 
a sensor node and the sink. A test is positive if a path is 
congested, namely, if it contains at least one congested link. 
Note that in this case since the routing table is constantly 
changing, the route between a sensor node and the sink is 
constantly changing as well. Nevertheless the set of possible 
routes must be drawn from the underlying connectivity graph. 
Consequently, the matrix M can be assumed to be random 
and the problem is to determine how many different tests are 
required to identify the congested links. Note that, in contrast 
to the wired scenario, tests conducted between the same sensor 
node and sink yields new information here. A similar situation 
arises in the active monitoring case as well. Here one could 
randomly query along different routes to determine whether or 
not a path is congested. These tests can be collated to identify 
congested links. Note that in the active case the test matrix 
M is amenable to design in that one could selectively choose 
certain paths over others by considering weighted graphs. 

Motivated by the WSN scenario we describe pool designs 
based on random walks on graphs. As is well known a random 
walk is the state evolution on a finite reversible Markov chain. 
Each row of the binary test matrix is derived from the evolution 
of the random walk, namely, the ones on the jth row of M 
correspond to the vertices visited by the jth walk. This is close 
to the WSN scenario because as in the WSN scenario the 
path between two given nodes changes randomly. We develop 
several results in this context. 

First, we consider random walks that start either at a random 
node or an arbitrary node but terminate after some appro- 
priately chosen number of steps t. By optimizing the length 
of the walk we arrive at an interesting result for important 
classes of graphs. Specifically we show that the number of 
tests required to identify d defective items is substantially 
similar to that required in conventional group testing problems, 
except the fact that an extra term appears which captures the 
topology of the underlying graph. The best known result for 
the number of tests required when no graphical constraints are 
imposed scales as 0(d 2 \og(n/d)). For the graph constrained 
case we show that with m = 0(d 2 T 2 (n) log(n/d)) non- 



adaptive tests one can identify the defective items, where T{n) 
corresponds to the mixing time of the underlying graph G. 
Consequently, for the Erdos-Renyi random graph G(n,p) with 
p = il((log 2 n)/n), as well as expander graphs with constant 
spectral gap, it follows that m = 0(d 2 log 3 n) non-adaptive 
tests are sufficient to identify d defective items. In particular, 
for a complete graph where no pooling constraint is imposed, 
we have T(n) = 1, and therefore, our result subsumes the 
well-known result for the conventional group testing problem. 

Next we consider unbounded-length random walks that 
originate at a source node and terminate at a sink node. Both 
the source node and the sink node can either be arbitrary 
or be chosen uniformly at random. This directly corresponds 
to the network tomography problem that arises in the WSN 
context. This is because the source nodes can be viewed 
as sensor nodes, while the sink node maybe viewed as the 
fusion center, where data is aggregated. At any instant, we 
can assume that a random tree originating at the sensor nodes 
and terminating at the sink is realized. While this random tree 
does not have cycles, there exist close connections between 
random walks and randomly generated trees. Indeed, it is well 
known that the so called loop-erased random walks, obtained 
by systematically erasing loops in random walks, to obtain 
spanning trees, is a method for sampling spanning trees from 
a uniform distribution 1141 , In this scenario, we show that 
m = 0(d 3 log 3 n) non-adaptive tests are sufficient to identify 
d defective items. By considering complete graphs we also 
establish that the cubic dependence on d in this result cannot 
be improved. 

We will also consider noisy counterparts of the graph 
constrained group testing problem, where the outcome of each 
measurement may be independently corrupted (flipped) with 
probability^ < q < 1/2. We develop parallel results for 
these cases. In addition to a setting with noisy measurement 
outcomes, these results can be used in a so called dilution 
model (as observed in Q, ATI '). In this model, each item can 
be diluted in each test with some a priori known probability. 
In a network setting, this would correspond to the case where 
a test on a path with a congested link can turn out to be 
negative with some probability. We show that similar scaling 
results holds for this case as well. 

Other group testing problems on graphs: Several variations 
of classical group testing have been studied in the literature 
that possess a graph theoretic nature. A notable example is 
the problem of learning hidden sparse subgraphs (or more 
generally, hypergraphs), defined as follows (cf. 031): Assume 
that, for a given graph, a small number of the edges are 
marked as defective. The problem is to use a small number 
of measurements of the following type to identify the set 
of defective edges: Each measurement specifies a subset of 
vertices, and the outcome would be positive iff the graph 
induced on the subset contains a defective edge. Another 
variation concerns group testing with constraints defined by a 
rooted tree. Namely, the set of items corresponds to the leaves 
of a given rooted tree, and each test is restricted to pool all the 

'It is clear that if q > 1/2, one can first flip all the outcomes, and then 
reduce the problem to the q < 1/2 regime. For q = 1/2, since we only 
observe purely random noise, there is no hope to recover from the errors. 
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leaves that descend from a specified node in the tree (see 
Chapter 12]). To the best of our knowledge, our work is the 
first variation to consider the natural restriction of the pools 
with respect to the paths on a given graph. 

The rest of this paper is organized as follows. In Section |H] 
we introduce our notation and mention some basic facts related 
to group testing and random walks on graphs. Section |Hl| 
formally describes the problem that we consider and states 



our main results. In Section IV we prove the main results 



and finally, in Section[V]show instantiations of the result to the 
important cases of graph-constrained group testing on regular 
expander graphs and random graphs in the Erdos-Renyi model. 

II. Definitions and Notation 

In this section we introduce some tools, definition and 
notations which are used throughout the paper. 

Definition 1. For two given boolean vectors S and T of 
the same length we denote their element-wise logical or by 
S V T. More generally, we will use Vi=i to denote the 
element-wise or of d boolean vectors Si,..., Sd- The logical 
subtraction of two boolean vectors S — (si,...,s n ) and 
T = (<!,..., i„), denoted by S \T, is defined as a boolean 
vector which has a 1 at position i if and only if Sj = 1 and 
ti = 0. We also use \S\ to show the number of l's in (i.e., the 
Hamming weight of) a vector S. 

We often find it convenient to think of boolean vectors 
as characteristic vectors of sets. That is, x e {0, 1}" would 
correspond to a set X C [n] (where [n] := {1, . . . , n}) such 
that i 6 X iff the entry at the ith position of x is 1. In this 
sense, the above definition extends the set-theoretic notions of 
union, subtraction, and cardinality to boolean vectors. 

Matrices that are suitable for the purpose of group testing 
are known as disjunct matrices. The formal definition is as 
follows. 

Definition 2. An m X n boolean matrix M is called d- 
disjunct, if, for every column So and every choice of d columns 
Si,...,Sd of M (different from So), there is at least one 
row at which the entry corresponding to Sq is 1 and those 
corresponding to Si, . . . , Sd are all zeros. More generally, for 
an integer e > 0, the matrix is called (d, e)-disjunct if for 
every choice of the columns Si as above, they satisfy 



\s \ S/Si 



> e. 



i=l 



A (d, 0)-disjunct matrix is said to be d-disjunct. 

A classical observation in group testing theory states that 
disjunct matrices can be used in non-adaptive group testing 
schemes to distinguish sparse boolean vectors (cf. 0). More 
precisely, suppose that a <i-disjunct matrix M with n columns 
is used as the measurement matrix; i.e., we assume that the 
rows of M are the characteristic vectors of the pools defined 
by the scheme. Then, the test outcomes obtained by applying 
the scheme on two distinct d-sparse vectors of length n must 
differ in at least one position. More generally, if M is taken 
to be (d, e)-disjunct, the test outcomes must differ in at least 



e+1 positions. Thus, the more general notion of (d, e)-disjunct 
matrices is useful for various "noisy" settings, where we are 
allowed to have a few false outcomes (in particular, up to 
L(e— 1)/2J incorrect measurement outcomes can be tolerated 
by (d, e)-disjunct matrices without causing any confusion). 

For our application, sparse vectors (that are to be distin- 
guished) correspond to boolean vectors encoding the set of 
defective vertices (or edges) in a given undirected graph. The 
encoding is such that the coordinate positions are indexed by 
the set of vertices (edges) of the graph and a position contains 
1 iff it corresponds to a defective vertex (edge). Moreover, we 
aim to construct disjunct matrices that are also constrained to 
be consistent with the underlying graph. 

Definition 3. Let G — (V, E) be an undirected graph, and 
A and B be boolean matrices with |V| and \E\ columns, 
respectively. The columns of A are indexed by the elements 
of V and the columns of B are indexed by the elements of 
E. Then, 

• The matrix A is said to be vertex-consistent with G if 
each row of A, seen as the characteristic vector of a subset 
of V, exactly represents the set of vertices visited by some 
walk on G. 

• The matrix B is said to be edge-consistent with G if each 
row of B, seen as the characteristic vector of a subset of 
E, exactly corresponds to the set of edges traversed by a 
walk on G. 

Note that the choice of the walk corresponding to each row 
of A or B need not be unique. Moreover, a walk may visit a 
vertex (or edge) more than once. 

Definition 4. An undirected graph G — (V,E) is called 
(D, c) -uniform, for some c > 1, if the degree of each vertex 
v £ V (denoted by deg(v)) is between D and cD. 

Definition 5. The point-wise distance of two probability 
distributions fi, // on a finite space ft is defined as 



IM- A* Hoc 



max \u(i) 



where p,(i) (resp., fi'(i)) denotes the probability assigned by 
/i (resp., p') to the outcome i 6 fl. We say that the two 
distributions are <5-close if their point-wise distance is at most 
S. 

For notions such as random walks, stationary distribution 
and mixing time we refer to many text books on probability 
theory, Markov chains, and randomized algorithms. In partic- 
ular for an accessible treatment of the basic notions, see lfl6l 
Chapter 6] or ifTTl Chapter 7]. The particular variation of the 
mixing time that we will use in this work is defined with 
respect to the point-wise distance as follows. 

Definition 6. Let G = (V, E) with \V\ =nbea (D,c)- 
uniform graph and denote by /i its stationary distribution. For 
v G V and an integer r, denote by [l t u the distribution that 
a random walk of length r starting at v ends up at. Then, 
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the S-mixing time of G (with respect to the 1^ normal is the 
smallest integer t such that — p,\\oo < <^ f° r Vt > £ and 
Vt> € V. For concreteness, we define the quantity T(n) as the 
5-mixing time of G for J := (l/2cn) 2 . 

Throughout this work, the constraint graphs are considered 
to be (D, c)-uniform, for an appropriate choice of D and some 
(typically constant) parameter c. When c = 1, the graph is ir- 
regular. 

For a graph to have a small mixing time, a random walk 
starting from any vertex must quickly induce a uniform 
distribution on the vertex set of the graph. Intuitively this 
happens if the graph has no "bottle necks" at which the walk 
can be "trapped", or in other words, if the graph is "highly 
connected". The standard notion of conductance, as defined 
below, quantifies the connectivity of a graph. 

Definition 7. Let G = (V, E) be a graph on n vertices. For 
every S C V, define_ A(5) := J2 v es de s0)> S := V \ S, 
and denote by E(S, S) the number of edges crossing the cut 
defined by S and its complement. Then the conductance of G 
is defined by the quantity 

$((?):= mm — ttt^- 

SCV : A(S)<|-E| A(S) 

We also formally define two important classes of graphs, 
for which we will specialize our results. 

Definition 8. Take a complete graph on n vertices, and remove 
edges independently with probability 1 — p. The resulting 
graph is called the Erdos-Renyi random graph, and denoted 

by G(n,p). 



Definition 9. For a graph G = (V,E) with \V\ 
(edge) expansion of G is defined as 

E(S,S) 



n, the 



KG) 



mm 

SCV: 0<|S|<§ 
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A family Q of D-regular graphs is called an (edge) expander 
family if there exists a constant a > such that h(G) > a for 
each G £ Q. In particular each G £ Q is called an expander 
graph. 

For a general study of Erdos-Renyi random graphs and their 
properties we refer to the fascinating book of Bollobas |18|. 
For the terminology on expander graphs, we refer the reader 
to the excellent survey by Hoory, Linial and Wigderson |19|. 

Definition 10. Consider a particular random walk W := 
(vo,vi, . . . ,v t ) of length ( on a graph G — (V,E), where 
the random variables V{ £ V denote the vertices visited by the 
walk, and form a Markov chain. We distinguish the following 
quantities related to the walk W: 

• For a vertex v £ V (resp., edge e £ E), denote by ir v 
(resp., 7r e ) the probability that W passes v (resp., e). 

• For a vertex veV (resp., edge e £ E) and subset A C 
V, v ^ A (resp., B C E, e ^ B), denote by tt v< a (resp., 

2 Note that the mixing time highly depends on the underlying distance by 
which the distance between two distributions is quantified. In particular, we 
are slightly deviating from the more standard definition which is with respect 
to the variation (£i) distance (see, e.g., 1171 Definition 11.2]). 



7r e ,s) the probability that W passes v but none of the 
vertices in A (resp., passes e but none of the edges in 
B). 

Note that these quantities are determined by not only 
v,e,A, B (indicated as subscripts) but they also depend on 
the choice of the underlying graph, the distribution of the 
initial vertex vq and length of the walk t. However, we find 
it convenient to keep the latter parameters implicit when their 
choice is clear from the context. 

In the previous definition, the length of the random walk 
was taken as a fixed parameter t. Another type of random 
walks that we consider in this work have their end points as 
a parameter and do not have an a priori fixed length. In the 
following, we define similar probabilities related to the latter 
type of random walks. 

Definition 11. Consider a particular random walk W := 
(vo,vi, . . . ,u) on a graph G = (V,E) that continues until 
it reaches a fixed vertex u G V. We distinguish the following 
quantities related to W: For a vertex v € V (resp., edge e £ E) 
and subset A C V, v ^ A (resp., B C E, e £ B), denote by 
tt^ a (resp., ir e g) the probability that W passes v but none of 
the vertices in A (resp., passes e but none of the edges in B). 

Again these quantities depend on the choice of G and the 
distribution of vq that we will keep implicit. 

III. Problem setting and Main Results 

Problem Statement. Consider a given graph G = (V, E) in 
which at most d vertices (resp., edges) are defective. The goal 
is to characterize the set of defective items using a number 
of measurements that is as small as possible, where each 
measurement determines whether the set of vertices (resp., 
edges) observed along a path on the graph has a non-empty 
intersection with the defective set. We call the problem of 
finding defective vertices vertex group testing and that of 
finding defective edges edge group testing. 

As mentioned earlier, not all sets of vertices can be grouped 
together, and only those that share a path on the underlying 
graph G can participate in a pool (see Fig. [3}. 

In the following, we introduce four random constructions 
(designs) for both problems. The proposed designs follow the 
natural idea of determining pools by taking random walks on 
the graph. 



Design 1. 



Given: a constraint graph G = (V,E) with r > 
designated vertices si, . . . ,s r £ V, and integer 
parameters m and t. 
Output: an m x \V\ boolean matrix M. 
Construction: Construct each row of M inde- 
pendently as follows: Let v £ V be any of 
the designated vertices s i; or otherwise a vertex 
chosen uniformly at random from V. Perform a 
random walk of length t starting from v, and let 
the corresponding row of M be the characteristic 
vector of the set of vertices visited by the walk. 
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Fig. 3. The result of pool 1 is positive since it contains a defective item, 
whereas the result of pool 2 is negative since it does not contain a defective 
item. Pool 3 is not consistent with the graph and thus not allowed since the 
items are not connected by a path. 



Design 2. 

Given: a constraint graph G = (V, E) and 
integer parameters m and t. 
Output: an m x \E\ boolean matrix M. 
Construction: Construct each row of M inde- 
pendently as follows: Let v £ V be any arbitrary 
vertex of G. Perform a random walk of length 
t starting from v, and let the corresponding row 
of M be the characteristic vector of the set of 
edges visited by the walk. 



Design 3. 

Given: a constraint graph G = (V, E) with r > 
designated vertices s\, . . . , s r € V, a sink node 
u 6 V, and integer parameter m. 
Output: an m x \V\ boolean matrix M. 
Constructions: Construct each row of M inde- 
pendently as follows: Let v G V be any of 
the designated vertices Sj, or otherwise a vertex 
chosen uniformly at random from V. Perform 
a random walk starting from v until we reach 
it, and let the corresponding row of M be the 
characteristic vector of the set of vertices visited 
by the walk. 



Design 4. 

Given: a constraint graph G = (V, E), a sink 
node u E V, and integer parameter m. 
Output: an m x \E\ boolean matrix M. 
Construction: Construct each row of M indepen- 
dently as follows: Let v € V be any arbitrary 
vertex of G. Perform a random walk, starting 
from v until we reach u, and let the correspond- 
ing row of M be the characteristic vector of the 
set of edges visited by the walk. 



TABLE I 

The asymptotic values of various parameters in Theorem[T3"1 



Puriimctcr 




D 


0(c^dT' 2 (n)) 


mi , mi 


0(c 4 ^T^(n)log(n/d)) 


mz 


0(c s ci 3 T 4 (n)log(n/(f)) 


7714 


0(c 9 d 3 L>T 4 (n)log(n/(2)) 


h 


0{n/(c 6 dT(n))) 




0(nD/(c 3 (iT(7i))) 


ei, e 2 , e 3 , e 4 


n(r?<21og(n/d)/(l - rj)^) 


m' t ,i e [4] 


0(m,/(l-^) 



By construction, Designs 1 and 3 (resp., Designs 2 and 4) 
output boolean matrices that are vertex- (resp., edge-) con- 
sistent with the graph G. Our main goal is to show that, 
when the number of rows m is sufficiently large, the output 
matrices become d-disjunct (for a given parameter d) with 
overwhelming probability. 

Remark 12. Designs 1 and 3 in particular provide two choices 
for constructing the measurement matrix M. Namely, the start 
vertices can be chosen within a fixed set of designated vertices, 
or, chosen randomly among all vertices of the graph. As 
we will see later, in theory there is no significant difference 
between the two schemes. However, for some applications it 
might be the case that only a small subset of vertices are 
accessible as the starting points (e.g., in network tomography 
such a subset can be determined by the vantage points), and 
this can be modeled by an appropriate choice of the designated 
vertices in Designs 1 and 3. 

The following theorem states the main result of this work, 
showing that our proposed designs indeed produce disjunct 
matrices that can be used for the purpose of graph-constrained 
group testing. We will state both noiseless results (correspond- 
ing to rf-disjunct matrices), and noisy ones (corresponding to 
(d, e)-disjunct ones, where the noise tolerance e depends on a 
fixed "noise parameter" 77 G [0, 1)). The proof of the following 
theorem is given in Section [PV] 

Theorem 13. Let rj > be a fixed parameter, and suppose 
that G = (V, E) is a (D, c)-uniform graph on n vertices with 
S-mixing time T(n) (where S := (l/2cn) 2 j. Then there exist 
parameters with asymptotic values given in Table [7] such that, 
provided that D > Dq, 

1) Design 1 with the path length t := t\ and the number 
of measurements m := mi outputs a matrix M that is 
vertex-consistent with G. Moreover, once the columns of 
M corresponding to the designated vertices Si, . . . , s r 
are removed, the matrix becomes d-disjunct with proba- 
bility 1 — o(l). More generally, for m := m\ the matrix 
becomes (d, e\)-disjunct with probability 1 — o(l). 

2) Design 2 with path length t := ti and m := 777,2 mea- 
surements outputs a matrix M that is edge-consistent 
with G and is d-disjunct with probability 1 — o(l). More 
generally, for m := m' 2 the matrix becomes (d, e.2)- 
disjunct with probability 1 — o(l). 

3) Design 3 with the number of measurements m := 7773 
outputs a matrix M that is vertex-consistent with G. 
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Moreover, once the columns of M corresponding to the 
designated vertices s%, . . . ,s r and the sink node u are 
removed, the matrix becomes d-disjunct with probability 
1 — o(l). More generally, for m := 7773 the matrix 
becomes (d, e^) -disjunct with probability 1 — o(l). 
4) Design 4 with the number of measurements m := 7774 
outputs a matrix M that is edge-consistent with G and 
is d-disjunct with probability 1 — o(l). More generally, 
for m := 7774 the matrix becomes (d, ei)-disjunct with 
probability 1 — o(l). 

Remark 14. In Designs 1 and 3, we need to assume that the 
designated vertices (if any) are not defective, and hence, their 
corresponding columns can be removed from the matrix M. 
By doing so, we will be able to ensure that the resulting matrix 
is disjunct. Obviously, such a restriction cannot be avoided 
since, for example, M might be forced to contain an all-ones 
column corresponding to one of the designated vertices and 
thus, fail to be even 1-disjunct. 

Remark 15. By applying Theorem [13] on the complete graph 
(using Design 1), we get 0(d 2 log(n/d)) measurements, since 
in this case, the mixing time is T(n) = 1 and also c = 1. 
Thereby, we recover the trade-off obtained by the probabilistic 
construction in classical group testing (note that classical 
group testing corresponds to graph-constrained group testing 
on the vertices of the complete graph). 

We will show in Section [V] that, for our specific choice 
of S := (l/2cn) 2 , the ^-mixing time of an Erdos-Renyi 
random graph G(n,p) is (with overwhelming probability) 
T(n) = 0(\ogn). This bound more generally holds for any 
graph with conductance £1(1), and in particular, expander 
graphs with constant spectral gap. Thus we have the following 
result (with a summary of the achieved parameters given in 
Table 0. 

Theorem 16. There is an integer Dq = 0(<ilog 2 n) such that 
for every D > Dq the following holds: Suppose that the graph 
G is either 

1) A D-regular expander graph with normalized second 
largest eigenvalue (in absolute value) A that is bounded 
away from 1; i.e., A = 1 — 0(1), or, 

2) An Erdos-Renyi random graph G(n,D /n). 

Then for every 77 € [0, 1), with probability 1 — o(l) Designs 
1, 2, 3, and 4 output (ei, e)-disjunct matrices (not considering 
the columns corresponding to the designated vertices and the 
sink in Designs 1 and 3), for some e — O^dlogn), using 
respectively mi, 7722,7713,7714 measurements, where 7711,7772 = 
0(d 2 (log 3 n)/(l - T?) 2 ), m 3 = 0(d 3 (log 5 n)/(l - T]) 2 ), and 
m 4 = 0(d 3 L»(log 5 n)/(l - 77) 2 ). 



TABLE II 

The asymptotic values of the bounds achieved by Theorem[T61 



The fixed-input case. Recall that, as Theorem 13 shows, 
our proposed designs almost surely produce disjunct matrices 
using a number of measurements summarized in Table [I] 
Thus, with overwhelming probability, once we fix the resulting 
matrix, it has the combinatorial property of distinguishing 
between any two o?-sparse boolean vectors (each corresponding 
to a set of up to d defective vertices, not including designated 
ones, for Designs 1 and 3, or up to d defective edges for 



Parameter 


Value 


D 


0(dlog 2 n) 


mi , rri2 


0(^(log d n)/(l-p) 2 ) 


"13 


0{d 3 (log b n)/(l- P ) 2 ) 


rru 


0(d 3 D{log b n)/(l -p) 2 ) 


e 


£l(r)d log n) 



Designs 2 and 4) in the worst case. However, the randomized 
nature of our designs can be used to our benefit to show 
that, practically, one can get similar results with a number of 
measurements that is almost by a factor d smaller than what 
required by Theorem [T3] Of course, assuming a substantially 
lower number of measurements, we should not expect to obtain 
disjunct matrices, or equivalently, to be able to distinguish 
between any two sparse vectors in the worst case. However, 
it can be shown that, for every fixed d-sparse vector x, the 
resulting matrix with overwhelming probability will be able to 
distinguish between x and any other d-sparse vector using a 
lower number of measurements. In particular, with overwhelm- 
ing probability (over the choice of the measurements), from the 
measurement outcomes obtained from x, it will be possible to 
uniquely reconstruct x. More precisely, it is possible to show 



the following theorem, as proved in Section IV 



Theorem 17. Consider the assumptions of Theorem 13 and 
let 7 := (logn)/(dlog(n/d)). Consider any fixed set of up to 
d vertices S C V such that \S\ < d and S D {si, . . . , s r } = 
and any fixed set of up to d edges T C E, \T\ < d. Then with 
probability 1 — o(l) over the randomness of the designs the 
following holds. 

Let M\ , . . . M4 respectively denote the measurement matri- 
ces produced by Designs 1, . . . ,4 with the number of rows set 
to 0(7777^)- Then for every S' C V and every T' C E such 
that S' ^ S, T' -£T and \S'\ < d, S' H {si, . . . , s r } = 0, 
\T'\ < d, we have that 

1) The measurement outcomes of Mi on S and S' (resp., 
M3 on S and S') differ at more than il^yei) (resp., 
0(763)) positions. 

2) The measurement outcomes of M2 on T and T' ( resp., 
Mi on T and T') differ at more than 0(762) (resp., 
0(764)) positions. 

A direct implication of this result is that (with overwhelming 
probability), once we fix the matrices obtained from our 
randomized designs with the lowered number of measurements 
(namely, having 0(7777^) m O^n^/d) rows), the fixed matrices 
will be able to distinguish between almost all pairs of d-sparse 
vectors (and in particular, uniquely identify randomly drawn d- 
sparse vectors, with probability 1— o(l) over their distribution). 
Example in Network Tomography. Here we illustrate a sim- 
ple concrete example that demonstrates how our constructions 
can be used for network tomography in a simplified model. 
Suppose that a network (with known topology) is modeled by 
a graph with nodes representing routers and edges representing 
links that connect them, and it is suspected that at most d links 
in the network are congested (and thus, packets routed through 
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them are dropped). Assume that, at a particular "source node" 
s, we wish to identify the set of congested links by distributing 
packets that originate from s in the network. 

First, s generates a packet containing a time stamp t 
and sends it to a randomly chosen neighbor, who in turn, 
decrements the time stamp and forwards the packet to a 
randomly chosen neighbor, etc. The process continues until 
the time stamp reaches zero, at which point the packet is 
sent back to s along the same path it has traversed. This 
can be achieved by storing the route to be followed (which is 
randomly chosen at s) in the packet. Alternatively, for practical 
purposes, instead of storing the whole route in the packet, s 
can generate and store a random seed for a pseudorandom 
generator as a header in the packet. Then each intermediate 
router can use the specified seed to determine one of its 
neighbors to which the packet has to be forwarded. 

Using the procedure sketched above, the source node gen- 
erates a number of independent packets, which are distributed 
in the network. Each packet is either returned back to s in 
a timely manner, or, eventually do not reach s due to the 
presence of a congested link within the route. By choosing 
an appropriate timeout, s can determine the packets that are 
routed through the congested links. 

The particular scheme sketched above implements our De- 



sign 2, and thus Theorem 13 implies that, by choosing the 
number of hops t appropriately, after generating a sufficient 
number of packets (that can be substantially smaller than the 
size of the network), s can determine the exact set of congested 
links. This result holds even if a number of the measurements 
produce false outcomes (e.g., a congested link may neverthe- 
less manage to forward a packet, or a packet may be dropped 
for reasons other than congestion), in which case by estimating 
an appropriate value for the noise parameter p in Theorem 13 
and increasing the number of measurements accordingly, the 
source can still correctly distinguish the congested links. Of 
course one can consider different schemes for routing the test 
packets. For example, it may be more desirable to forward 
the packets until they reach a pre-determined "sink node", an 
approach that is modeled by our Designs 3 and 4 above. 

IV. Proof of Theorems[T31and[T71 

Before discussing Theorem [13] and its proof, we introduce 
some basic propositions that are later used in the proof. The 
omitted proofs will be presented in the appendix. Throughout 
this section, we consider an underlying graph G = (V, E) that 
is (D, c)-uniform, with mixing time T(n) as in Definition [6] 

Proposition 18. Let A, Bi, B2, ■ ■ ■ , B n be events on a finite 
probability space, define B := Uf =1 Bi, and suppose that: 

1) For every i £ [n], Pr[A | Bj\ < e. 

2) For every set S C [n] with \S\ > k, r\ ie sBi = 0. 
Then, Pr[A | B] < ek. 



The proof of this proposition may be found in Section VI-A 



The following proposition is a corollary of a well-known 
result for the stationary distribution of irregular graphs [17, 
Theorem 7.13]. A formal proof of this proposition is given in 
Section IVLBl 



Proposition 19. Let G = (V, E) be a (D, c)-uniform graph, 
and denote by p the stationary distribution of G (assuming that 
G is not bipartite). Then for each v € V, 1/cn < p(v) < c/n. 



Proposition 20. For the quantities ir v and ir e in Definition 10 
we have 



( t 



7T e = il 



t 



\cnT(n) J ' \cDnT(n) 
The proof of this Proposition [20] is presented in Sec- 



tion 



VI-C In fact, a stronger statement than this proposition 



can be obtained, that with noticeable probability, every fixed 
vertex (or edge) is hit by the walk at least once but not too 
many times nor too "early". This is made more precise in the 
following two propositions, which are proved in Sections[VI-D| 



and VI-E respectively. 



Proposition 21. Consider any walk W in Design 1 (resp., 
Design 2). There is a k — 0(c 2 T(n)) such that, for every 
v G V and every e € E, the probability that W passes v 
(resp., e) more than k times is at most 7r„/4 (resp., ir e /4). 

Proposition 22. For any random walk W in Design 1, let 
v £ V be any vertex that is not among the designated vertices 
8\, ... , s r . Then the probability that W visits v within the first 
k steps is at most k/D. 

The following proposition shows that the distributions of 
two vertices on a random walk that are far apart by a 
sufficiently large number of steps are almost independent. The 



proof of this proposition may be found in Section VI-F 

Proposition 23. Consider a random walk w := 
(vo, V\, . . . , v-t) on G starting from an arbitrary vertex, 
and suppose that j > i + T(n). Let £ denote any event that 
only depends on the first i vertices visited by the walk. Then 
for every V, 

I Pr[ Vi = u\vj =v,S]- Pr[vi = u\S]\ < 2/(3cn). 

The following lemmas, which form the technical core of 

(u) (u) 

this work, lower bound the quantities ir v a, ^e,B, n v n e g 
as defined by Definitions [10] and [TT] 

Lemma 24. There is a Dq = 0(c 2 dT 2 (n)) and t\ = 
0(n/(c dT(n))) such that whenever D > Dq, by setting the 
path lengths t := t\ in Design 1 the following holds. Let 
v £ V, and A C V be a set of at most d vertices in G such 
that v (£ A and Al_){v} does not include any of the designated 
vertices S\, . . . s r . Then 

Proof: Denote by p the stationary distribution of G. We 



know from Proposition 19 that for each u € V, 1/cn < p u < 
c/n. 



21 



Let k = 0(c 2 T(n)) be the quantity given by Proposition 
B denote the bad event that W hits some vertex in A. 
Moreover, let Q denote the good event that W hits v no more 
than k times in total and never within the first 2T(n) steps. 
The probability of Q is, by Propositions [2T] and [22] at least 

Pr(£) > 1 - 2T(n)/D - 0(t/cnT(n)), 
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which can be made arbitrarily close to 1 (say larger than 0.99) 
by choosing D sufficiently large and t sufficiently small (as 
required by the statement). Now, 



In order to bound m, we observe that of all D or more 
neighbors of Vi, at most d can lie on A. Therefore, 



7r UlJ i=PrHB,i;eW] 
> Pr[-vB,i; e W,G] 
= Pt[v G W,G](1-Pt[B 



eW,G}). 



(2) 



By taking D large enough, and in particular, D 
il(c 2 dT 2 (n)), we can ensure that 

2T(n)/D < tt v /4:. 



Combined with Proposition 21 we have Pr[t> G W,G] > 
ir v /2, since 

Pr[v EW,G} = Pi[v £W}+ Pr[0] - Pr[(« G W) U G] 

> 7T V + (1 - 7T„/2) - 1 = 7T v /2. 

Thus, Q gives 

Tr v , A >ir v (l-Pr[13\v€W,G})/2. (3) 

Now we need to upperbound n := Pi[B \ v G W,G]. Before 
doing so, fix some i > 2T(n), and assume that v. L = v. 
Moreover, fix some vertex u A and assume that vq = u. 
We first try to upperbound Pi[B \ Vi = v, vq = u]. 

Let i := % — T(n) and p := i + T(n), and for the moment, 
assume that T(n) + l<£<p<t(a "degenerate" situation 
occurs when this is not the case). Partition W into four parts: 

Wi := (v ,v 1 , . . .,v T ( n )), 

W 2 := (Vr(7i)+X»«T(n)+2) • • • ) V£-i), 

W 3 := (vt,vi + i, . . .,v p ), 
W 4 := {v p+1 ,v p+2 , . . .,v t ). 

For j = 1,2,3,4, define 

7Tj := Pr[Wj- enters A | Vi = v, Vo = u]. 

Now we upperbound each of the ttj. In a degenerate situation, 
some of the Wi may be empty, and the corresponding ir,j will 
be zero. 

Each of the sub-walks W2 and W4 are "oblivious" of 
the conditioning on Vi and vo (because they are sufficiently 
far from both and Proposition [23] applies). In particular, the 
distribution of each vertex on W4 is point-wise close to [i. 
Therefore, under our conditioning the probability that each 
such vertex belongs to A is at most |A| (c/n + 6) < 2dc/n. 
The argument on W2 is similar, but more care is needed. 
Without the conditioning on Vi, each vertex on W2 has an 
almost-stationary distribution. Moreover, by Proposition [23] 
the conditioning on w 2 changes this distribution by up to 
5' := 2 /(3cn) < 1/n at each point. Altogether, for each 
j e {T(n) + 1, ... ,£ - 1}, we have 

Pr[vj e A \ Vi = v 7 v = u] < \A\(c/n + 6 + S') 

< 2dc/n. 

Using a union bound on the number of steps, we conclude 
that 7T2 + 7T4 < 2dct/n. 



Pr[v l+ i G A I Vi = v,v Q — u] < d/D. 



Similarly, 



Pr[v i+2 G A I Vi = v,v = u,v i+ i] < d/D, 
regardless of Vi+i which means 

Pr[v l+2 G A\ Vi = v,v = u] < d/D, 
and in general, 
(Vj = i + 1, . . . , p), Pr[vj G A I Vi = v, v = u] < d/D 

Similarly we have, 

Pr[«i_i G A I v t = v] < d/D, 



(4) 



and by Proposition 23 (and time-reversibility), conditioning on 
vq changes this probability by at most dS'. Therefore, 

Pr[-y 4 _! G A I Vi — v, v = u] < d/D + dS' , 

and in general, 

(Vj = t,...,i-1), 

Pr[vj G A I Vi = v, v = u] < d/D + dS' . (5) 

Altogether, using a union bound and by combining Q and 
d5}, we get that 

7T 3 < 2dT(n) /D + dT(n) /n < MT{n) /D. 

Using the same reasoning, tti can be bounded as 

7Ti < dT(n)/D + dT(n)/n < 2dT(n)/D. 

Finally, we obtain 

Pr[2? I Vi = V, Vq — V,] < 7Ti + 7T 2 + 7T 3 + 7T 4 

5dT(n) 2dct 
• (6) 



D n 

Our next step is to relax the conditioning on the starting point 
of the walk. The probability that the initial vertex is in A is 
at most d/n (as this happens only when the initial vertex is 
taken randomly), and by Proposition [23] conditioning on Vi 
changes this probability by at most dS' < d/n. Now we write 

Pr[S \vi = v]< Pt[v G A] + Pr[B \v t =v,v <£ A] 

< Pv[Vq G A] + 7Ti + 7T 2 + 7T 3 + 7T 4 

hdTin) Add 
D n 

where we have used the chain rule in the first inequality, and 
Proposition 18 with k = 1 for the second one. Now, since 
Pr[G] is very close to 1, conditioning on this event does not 
increase probabilities by much (say no more than a factor 1.1). 
Therefore, 



Pr[B I = v, G] < 



hl ( 5dT(n) 



Add 



\ D 



Now in the probability space conditioned on G, define events 
Gi, i = 2T(n) + l,...t, where Gi is the event that Vi = v. 



to 



Note that the intersection of more than k of the Q\ is empty 
(as conditioning on Q implies that the walk never passes v 
more than k times), and moreover, the union of these is the 
event that the walk passes v. Now we apply Proposition 18 to 
conclude that 



Pr[B | v € W,G] < 1.1k 



(bdT(n) 



D 



4dcA 
n ) 



O c z T{n) 



/5dT{n) 
\ D 



4dct\ 
n J 



By taking D = n(c 2 dT 2 (n)) and t = O (n / c 3 dT \n)) we can 
make the right hand side arbitrarily small (say at most 1/2). 
Now we get back to ([3]) to conclude, using Proposition [20] 
that 



t 



cnT(n) 



= n 



1 

c 4 dT 2 (n) 



Similarly, we can bound the edge-related probability 7r e .s as 
in the following lemma. The proof of the lemma is very similar 



to that of Lemma 24 and is therefore skipped for brevity. 

Lemma 25. There is a D = 0(c 2 dT 2 (n)) and t 2 = 
0(nD/c 3 dT(n)) such that whenever D > Do, by setting the 
path lengths t :— t 2 in Design 2 the following holds. Let 
B C E be a set of at most d edges in G, and e G E, e ^ B. 
Then 

*•>* = n (^>y) • (7) 

In Designs 3 and 4, the quantities tt^a and defined in 



Definition 11 play a similar role as tt Vi a and Tr e b- In order to 



prove disjunctness of the matrices obtained in Designs 3 and 4, 
we will need lower bounds on and ir^g as well. In the 
following we show the desired lower bounds. 

Lemma 26. There is a Dq = 0(c 2 dT 2 (n)) such that when- 
ever D > Dq, in Design 3 the following holds. Let v € V, and 
A C V be a set of at most d vertices in G such that v ^ A 
and A U {v} is disjoint from {s\, . . . s r , u}. Then 



c s d 2 T 4 (n) 



(8) 



']24 



Proof: Let Dq and t\ be quantities given by Lemm; 
Let Wo denote the start vertex of a walk performed in Design 3 
and consider an infinite walk W = (vq, «i, 1)2, . . .) that starts 
from a vertex identically distributed with wq. Let the random 
variables i,j,k respectively denote the times that W visits 
v, u, and any of the vertices in A for the first time. Therefore, 
Vi = v, vj = u, and g A, v t ^ v for every t < i and so 

on. Then the quantity tt^ a mat we wish to bound corresponds 
to the probability that i < j < k, that is, probability of the 
event that in W, the first visit of v occurs before the walk 
reaches the sink node u for the first time, and moreover, the 
walk never hits A before reaching u. Observe that this event 
in particular contains the sub-event that i < t\, t\ < j < 2ti, 

3 In fact, as will be clear by the end of the proof, Lemma |24| should be 
applied with the sparsity parameter d + 1 instead of d. However, this will 
only affect constant factors that we ignore. 



and k > 2t\, where t\ is picked as in Lemma 24 Denote 
by W C V tl+1 the set of all sequences of t\ + 1 vertices of 
G (i.e., walks of length t{) that include v but not any of the 
vertices in A U {u}. Now, we can write 



» 



(9) 



Pr[i < j < k] 
> Pr[i < ti < j < 2h < k] 
= Pr[(t < ti) A (j > h) A (k > ti)] ■ 
Pr[«i < j < 2*i < k | 

(*<*i)A(j>*i)A(A;>ti)] 
= Pii(v ,...,v tl )eW}- 

Pr[tj <j<2h<k \ (v ,...,v tl ) £ W] 

(10) 

The probability Pr[(«o, ■ ■ ■ ,v tl ) E W] is exactly n v> Au{u} 
with respect to the start vertex wo- Therefore, Lemma 24 gives 
the lower bound 

Pr[(, ,...,, tl)eW ]=o(- i ^ (n) 

Furthermore observe that, regardless of the outcome 
(v , ■ ■ ■ ,v tl ) 6 W, we have 

Pr[ii < j < 2tx < k | wo, . 



where ir u a is taken with respect to the start vertex v tl 
Therefore, since v tl ^ A U {u}, again we can use Lemma 24 
to conclude that 

Pr[h <j<2h<k\ K ...,v tl )eW]=Sl (^2^j 

By plugging the bounds in (jTOJ the claim follows. ■ 
A similar result can be obtained for Design 4 on the edges. 
Since the arguments are very similar, we only sketch a proof. 

Lemma 27. There is a D Q = 0{c 2 dT 2 (n)) such that when- 
ever D > Dq, in Design 4 the following holds. Let B C E be 
a set of at most d edges in G, and e€-E, e ^ B. Then 

c 9 d 2 DT 4 (n)j ' (U) 



» 



9. 



Proof: (sketch) Similar to the proof of Lemma 26 



consider an infinite continuation W = (v , v±, . . .) of a walk 
performed in Design 4 and focus on its first t\ + <2 steps, 
where t\ and t% are respectively the time parameters given by 
Lemmas [24] and [25] Let 

Wi := (v , ...jWtJ, 

W 2 := (v tl+ i,...,v tl+ t 2 ). 

Again following the argument of Lemma [26] we lower bound 
71-^ by the probability of a sub-event consisting the intersec- 
tion of the following two events: 

1) The event E\ that W\ visits e but neither the sink node 
u nor any of the edges in B, and 

2) The event 82 that W2 visits the sink node u but none of 
the edges in B. 

Consider the set A C V consisting of the endpoints of the 
edges in B and denote by v E V any of the endpoints of 
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e. Let p := ir Vt A (with respect to the start vertex vq). Now, 
Pr[£i] > p/(cD) since upon visiting v, there is a l/deg(«) 
chance that the next edge taken by the walk turns out to be e. 
The quantity p in turn, can be lower bounded using Lemma 24 
Moreover, regardless of the outcome of W%, the probability 
that W2 visits u but not B (and subsequently, the conditional 
probability Pr[£2 | £1]) is at least the probability 7r e < .b (with 
respect to the start vertex v tl ), where e' 6 E can be taken as 
any edge incident to the sink node u. This latter quantity can 



be lower bounded using Lemma |25] Altogether, we obtain the 
desired lower bound on tt 



(u) 



Remark 28. It is natural to ask whether the exponent of d 2 
in the denominator of the lower bound in Lemma [26] can 
be improved. We argue that this is not the case in general, 
by considering the basic where the underlying graph is the 
complete graph K n and each walk is performed starting from a 
random node. Consider an infinite walk W starting at a random 
vertex and moreover, the set of d+2 vertices A' := Au{u, v}. 
Due to the symmetry of the complete graph, we expect that 
the order at which W visits the vertices of A' for the first time 
is uniformly distributed among the (d + 2)! possible orderings 
of the elements of A'. However, in the event corresponding to 
jt|"|, we are interested in seeing v first, then u, and finally 
the elements of A in some order. Therefore, for the case of 
complete graph we know that n^A = 0(l/d 2 ), and thus, the 
quadratic dependence on d is necessary even for very simple 
examples. 

Remark 29. Another question concerns the dependence of 



the lower bound in Lemma 27 on the degree parameter D 



Likewise Remark 28 an argument for the case of complete 



graph suggests that in general this dependence cannot be 
eliminated. For edge group testing on the complete graph, we 
expect to see a uniform distribution on the ordering at which 
we visit a particular set of edges in the graph. Now the set of 
edges of our interest consists of the union of the set B U {e} 
and all the n — 1 edges incident to the sink node u, and is thus 
of size 71 + d. The orderings that contribute to 7Tg ^ must have 
e as the first edge and an edge incident to u as the second 
edge. Therefore we get that, for the case of complete graph, 

^l = 0(l/n) = 0(l/D), 

which exhibits a dependence on the degree in the denominator. 

Now, we are ready to prove our main theorem. 

We prove the first part of the 



Proof of Theorem 13 



theorem. Proofs of the other parts follow the same reasoning. 
The high-level argument is similar to the well known proba- 
bilistic argument in classical group testing, but we will have 
to use the tools that we have developed so far for working out 
the details. By construction, the output matrix M is vertex- 
consistent with G. Now, take a vertex v E V and A C V such 
that v ^ A, \A\ < d, and ({v} U A) n {s x , . . . , s r } = 0. For 
each i = l,... mi, define a random variable Xi € {0, 1} such 
that Xi = 1 iff the ith row of M has a 1 entry at the column 
corresponding to v and all-zeros at those corresponding to the 
elements of A. Let X := Note that the columns 

corresponding to v and A violate the disjunctness property 



of M iff X = 0, and that the Xi are independent Bernoulli 
random variables. Moreover, 

E{X l }=Pr[X l = 1] =7T V , A , 

since Xi = 1 happens exactly when the ith random walk 
passes vertex v but never hits any vertex in A. Now by using 



Lemma 24 we can ensure that, for an appropriate choice of 
Do and t\ (as in the statement of the lemma), we have 7r„ a = 
ft(l/(c 4 dT 2 (n))). 

Denote by the failure probability, namely that the 
resulting matrix M is not rf-disjunct. By a union bound we 
get 



(dlog^) • [ l-o 



v,A 

< exp 
Thus by choosing 



c 4 dT 2 {n) 



mi = O (d 2 c 4 T 2 (n) log 



we can ensure that Pf = o(l), and hence, M is d-disjunct 
with overwhelming probability. 

For the claim on (d, ei)-disjunctness, note that a failure 
occurs if, for some choice of the columns (i.e., some choice 
of v, A), we have X < e\. Set 



77' := r]7r v a = ^ 



n 



s c A dT 2 (n) J ' 

and ei := rjrn!^. Note that E[X] = n Vt A , m'i- Now by a 
Chernoff bound, we get 

/ (Em-7/m') 2> 
Pr[X < 7/771;] < exp -- 



2E[Z] 

= exp(-E[X](l- 77) 2 /2). 

So now, by a union bound, the failure probability pf 
becomes 

Pf < exp (dlog^- mUl-Ti) 2 ^), (12) 

where n is the lower bound £l(l/(c 4 dT 2 (n))) on ir Vt A- Thus 
we will have pj = o(l) by choosing 

m'^O^log^T 2 ^)/^-^ 2 ). 



Proof of Theorem [77} The proof follows line-by-line 
the same arguments as in the proof of Theorem [T3] except 
that for the last union bound it would suffice to enumerate a 
substantially lower number of choices of v and A. In particular, 



consider Design 1 as in the proof of Theorem 13 (the argument 



for the other designs is essentially the same). Then the only 
part of the proof that needs to be changed is the union bound 
from which ( fT2| ) follows. Contrary to the proof of Theorem 13 
in the case we consider here, only up to n choices of the tuple 
(v, A), in particular the following set, need to be enumerated: 

B:={(v,A):v<=V\{ Sl ,...,s r },A = S\{v}}. 

Now assume that the resulting matrix "satisfies" all the choices 
of the tuples (v, A) G B, in that it has enough rows at which 
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the entry corresponding to v is 1 while those corresponding to 
A are all zeros (this is guaranteed to hold, with overwhelming 
probability, by the union bound). 

Consider the case where S' ^ S and take any v E S' \ 
S. Since (v, S) € B, we can be sure that the measurement 
outcome corresponding to S' would be positive at more than 
f2(7ei) of the positions while at those positions, the outcome 
of S must be zero. A similar argument is true for the case 
S' C S, in which case it would suffice to take any v G S\S' 
and observe that (v, S \ {v}) € B. 

Altogether, from the above observations, the estimate ( fT"2"| ) 
can be improved to 

Pf < exp (log n — m(l — T/) 2 7r/2) , 

where rfi is the number of measurements. Therefore, we can 
ensure that pj = o(l) by taking m = O^m^), i.e., a factor 
7 less than what needed by Theorem [13] ■ 



V. Proof of Theorem[T61 
In Theorem [16] we consider two important instantiations of 



the result given by Theorem 13 namely when G is taken as an 
expander graph with constant spectral gap, and when it is taken 
as an Erdos-Renyi random graph G(n,p). In the following we 
show that in both cases (and provided that p is not too small), 
the mixing time is O(logn) (with probability 1 — o(l)). Then 



Theorem 13 will lead to the proof. 



Before we proceed, we need to bound the distance between 
the stationary distribution and the distribution obtained after 
t random steps on a graph. The following theorem, which is 
a direct corollary of a result in ll20l . is the main tool that we 
will need. We skip the proof of this theorem here and refer to 
the main article for interested readers. 

Theorem 30 ([ 20 1). Let G be an undirected graph with 
stationary distribution /x, and denote by <i m i n and d m&K the 
minimum and maximum degrees of its vertices, respectively. 
Let /i* be the distribution obtained by any random walk on G 
in t steps starting at node v. Then for all v G V 

||/4 - Hloo < (1 - $(G) 2 /2)*dmax/dmin, 

where Q(G) denotes the conductance of G as in Definition^ 

A. The Erdos-Renyi random graph 

First, we present some tools for the case of random 
graphs. Consider a random graph G(n,p) which is formed 
by removing each edge of the complete graph on n vertices 
independently with probability 1 — p. Our focus will be on 
the case where np ^> \nn. In this case, the resulting graph 
becomes (almost surely) connected and the degrees are highly 
concentrated around their expectations. In particular, we can 
show the following fact, which we believe to be folklore. The 



proof of this is presented in Section VI-G 



Proposition 31. For every e > 0, with probability 1 — o(l), 
the random graph G(n,p) with np > (2/e 2 )lnn is (np(l — 
e), (1 + e)/(l — e))-uniform. 



In light of Theorem [30] all we need to show is a lower 
bound on the conductance of a random graph. This is done in 
the following. 

Lemma 32. For every tp < 1/2, there is an a > such 
that a random graph G = G(n,p) with p > alnn/n has 
conductance &(G) > ip with probability 1 — o(l). 

Proof: First, note that by Proposition [5T| we can choose 
a large enough so that with probability 1 — o(l), the degree 
of each vertex in G is between D(l — e) and D(\ + e), for an 
arbitrarily small e > and D := np. We will suitably choose 
e later. 

Fix a set S C V of size i. We wish to upper bound the 
probability that S makes the conductance of G undesirably 
low, i.e., the probability that E(S,S) < ipA(S). Denote this 
probability by p$. By the definition of conductance and (D,e)- 
uniformity of G, we only need to consider subsets of size at 
most rjn, for 77 := (1 + e)/2(l — e). 

There are i(n — i) "potential" edges between S and its 
complement in G, where each edge is taken independently 
at random with probability p. Therefore, the expected size of 
E(S,S) is 

v := Di(l - i/n) > Di(l - 17). 
Now note that the event E(S, S) < <pA(S) implies that 

E{S,S) < tpDi(l + e) < ip'v, 

where tp' := ip(l + e)/(l — r}). So it suffices the upper bound 
the probability that E(S, S) < <p'v. Note that, since tp < 1/2, 
we can choose e small enough to ensure that tp' < 1. Now, by 
a Chernoff bound, 

Ps < Pr[E(S, S) < y'v] 

<exp(-(l-^)M 
< n- ia(1 - v '" )2{1 ~ n) . 

Set a large enough (i.e., a > 2/(1 — tp') 2 (l — 77)) so that 
the right hand side becomes at most n~ 21 . Therefore, with 
high probability, for our particular choice of S we have 
E{S,S)/p(S) > (p. 

Now we take a union bound on all possible choices of S to 
upper bound the probability of conductance becoming small 
as follows. 



Pr[$(G) < ip] < 



rjn 

n 11 



< 



i/n 



n~ l = o(l). 



Thus with probability 1 — oil), we have ^(G) > tp. 

■ 

By combining Lemma [32] and Theorem [30] we get the 



following corollary, which is formally proved in Section VI-H 



Corollary 33. There is an a > such that a random graph 
G = G(n,p) with p > ahin/n has S-mixing time bounded 
by 0(log(l/<5)) with probability 1 — o(l). 

In particular, for our specific choice of 5 := (l/2cn) 2 , the 
5-mixing time of G(n,p) would be T(n) = O(logn). 
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B. Expander Graphs with Constant Spectral Gap 

Similar to Corollary [33] we need to show that the mixing 
time of an expander graph with second largest eigenvalue that 
is bounded away from 1 is bounded by O(logn). 

Lemma 34. If G = (V, E) is an expander graph with a 
( normalized) second largest eigenvalue that is bounded away 
from 1 by a constant, then T(n) = O(logn). 

Proof: We first recall a well known result in graph 
theory (cf. lfT9lO . which states that any regular graph with a 
normalized adjacency matrix whose second largest eigenvalue 
(in absolute value) is bounded away from 1 must have good 
expansion (i.e., a = Sl(l)). 

Moreover, note that for regular graphs we have A(S) — 
D\S\, and therefore the two notions of conductance (Definition 
[7]i and expansion (Definition [9]l coincide (except a multiplica- 
tive constant). 



Finally, we can applying Theorem 30 to find the smallest t 
which satisfies 

1 



(l-$(G) 2 /2)* < 



{2nf 



which implies T(n) = 0(\ogn). ■ 

We now have all the tools required for proving Theorem [To] 



Proof of Theorem\16 
Theorem [T3] Proposition 



Follows immediately by combining 
3TJ Corollary [33] and Lemma [34] ■ 
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VI. Appendix 



A. Proof of Proposition 18 
We can write 



Pr[A | B] = 



< e • 

< ek 



Pt[B] 
Pt[B] 



The last inequality is due to the fact that each element of the 
sample space can belong to at most k of the Bi, and thus, 
the summation P r [^i] counts the probability of each 

element in B at most k times. 

B. Proof of Proposition [79] 

We start with a well-known result [17, Theorem 7.13], that 
a random walk on any graph graph G that is not bipartite 
converges to a stationary distribution ft, where 

d(v) 



fi(v) = 



2\E\ 



Since G is a (D, c)-uniform graph we know that D < d(v) < 
cD and that nD < 2\E\ < ncD. 

C. Proof of Proposition [20] 

Let t' := \t/T(n)\, and for each i £ {0, ...,t'}, Wi := 
v iT(n)- Denote by W := {u>o, . . . ,w t '} a subset of t' + 1 
vertices visited by W . Obviously, tt v is at least the probability 
that v 6 W'. Thus it suffices to lower bound the latter 
probability. 

By the definition of mixing time, regardless of the choice 
of Wo, the distribution of W\ is <5-close to the stationary 
distribution ft, which assigns a probability between 1/cn and 
c/n to v (by Proposition 19 1. Therefore, Pr[wi ^ v \ wq] < 
1 — 1/cn + 5. Similarly, Pr{w 2 ^ v \ w ,wi] < 1 — l/2cn, 
and so on. Altogether, this means that 



Pr[u> ^v,wi ^ v, 



■,wt> ^v] < (1- l/cn + Sf^^ 

< (1 - l/2cn) t / T (") 

< exp(-t/(2cnT(n))) 

< 1 - n(t/(cnT(n))). 

In the last equality we used the fact that exp(— x) < 1 — x/2 
for < x < 1. Thus the complement probability is lower 
bounded by Vt{t / (cnT (n)) . The calculation for 7r e is similar. 
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D. Proof of Proposition \21\ 

For every i — 0, . . . , t, define a boolean random variable 
X t £ {0, 1} such that X t = 1 iff v t = v. Let X := £* =0 X t 
be the number of times that the walk visits v. For every i > 
T(n), we have 

E[X t ] = Pr[vi = v] 

< c/n + S 

< 2c/n, 

where the first inequality is due to the assumption that 
i > T(n) and after the mixing time, the distribution induced 
on each vertex is within 6 of the stationary distribution, and the 
second inequality is by the particular choice of the proximity 
parameter i5. Define X' := Y^\=T{n) -^i- By linearity of 
expectation, E[X'] < 2ct/n, and by Markov's inequality, 



Pr[X' > ac 2 T(n)} < 



2t 



acnT(n) 



By taking a a large constant (depending on the constant hidden 



in the asymptotic estime of n v given by Lemma 20 1, and using 
Proposition [20] we can ensure that the bound on the probability 
is at most ir v /4. Thus the probability that X > k for k := 
(1 + ac 2 )T(n) is at most ir v /4. Proof for the edge case is 
similar. 



G. Proof of Proposition [57] 

Let a := 6/e 2 so that np > a Inn. Take any vertex v of the 
graph. The expected degree of v is np. As the edges are chosen 
independently, by a Chernoff bound, the deviation probability 
of deg(w) can be bounded as 

2 np/3 



Pr[| deg(w) — np\ 



> enp] < 2e 

< 2n-^ a / 3 = 2/ 



n 2 . 



This upper bounds the probability by 2-nT 2 . Now we can use 
a union bound on the vertices of the graph to conclude that 
with probability at least 1 — 2/n, the degree of each vertex in 
the graph is between np{l — e) and np(l + e). 



H. Proof of Corollary 33 



Choose a large enough so that, by Proposition 3 1 the graph 
becomes (np(l — e), (l + e)/(l — e))-uniform, for a sufficiently 



small e and so that Lemma 32 can be applied to obtain $(G) 



57(1). Let \il be the distribution obtained by any random walk 
on G in t steps and denote by u the stationary distribution of 
G. Now Theorem 30 implies that, 



||//-«lloo<(l-<i>(G) 2 /2)*(l + 6)/(l-e), 
and thus, it suffices to choose t = 0(\og(l/S)) to have 



E. Proof of Proposition 22 



By the choice of v (that is not a designated vertex), the 
walk W has a chance of visiting v as the initial vertex vq 
only if it starts at a vertex chosen uniformly at random. Thus 
the probability of visiting v at the initial step is 1/n < 1/D. 

Now, regardless of the outcome of the initial vertex vq, the 
probability of visiting v as the second vertex vi is at most 1/D, 
as vq has at least D neighbors and one is chosen uniformly 
at random. Thus, Pr[vi — v] < 1/D, and similarly, for each 
i, Pi[vi — v] < 1/D. A union bound gives the claim. 
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F. Proof of Proposition 23 
We can write 



Prfc = 



Vi = U \ V, 



= v,£] = Pr[vj = v | Vi = u, £] 



Prfo = 
Pr[ Vj = 



Now, from the definition of mixing time, we know that 



Pr[fj = v 



= u,£]-Pi[v j =v\£}\< 25, 



because regardless of the knowledge of Vi = u, the distribution 
of Vj must be 5-close to the stationary distribution. Therefore, 

|Pr[w 4 = u\vj =v,£]- Pr[u j; = u\£]\ < 2&/Pr[v j =v\£] 

< 25/(l/cn-6) 

< 8Scn/3. 
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